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BACKGROUND OF THE INVENTION 
[00041 Voice communication systems have incorporated many new techniques to 
improve speech quality. One of these techniques involves the use of pulse code 
modulation (PCM) of voice or speech signals. For example, the ITU-T G.711 standard 
may be employed to digitize and encode voice frequencies using one or more variants of 
PCM. Complementary codecs are utilized at the transmitter and receiver to perform such 
pulse code modulation (PCM). 

[00051 Prior to transmission at the transmitter, many voice communication systems 
typically employ linear G.711, ^-law G.711, or A-law G.711 types of pulse code 
modulation to a speech or voice waveform. When a voice waveform is digitized by way 
of such pulse code modulation and transmitted by a transmitter, a receiver must 



appropriately decode flie modulation to order to regenerate the signal transmitted &om 
the transmitter. The received signal is typically aDSO channel transmitting a digitized 64 
kilobit/second sampled PCM signal. 

[00061 Often, a newly implemented voice communication system or an existing 
problematic voice communication system may need to be diagnosed and tested at one or 
more points within the system. One of the problems that may be encountered during 
testing of such a communication system may relate to whether a proper PCM codec is 
utilized at the receiver. If the PCM codec at the receiver does not employ the 
corresponding decoding algorithm used by the PCM codec at the transmitter, voice 
quality may suffer because the received voice signal was improperly decoded. 
[00071 Furthermore, the inability to efficiently diagnose codec related performance issues 
may lead to undue testing of other subsystems within the communication system. Tins 
often results in system downtime and additional labor costs. 

[00081 Further limitations and disadvantages of conventional and traditional approaches 
will become apparent to one of skill in the art, through comparison of such systems with 
some aspects of the present invention as set forth in the remainder of the present 
application with reference to the drawings. 

BRIEF SUMMARY OF THE INVENTION 
[00091 Aspects of the invention provide a method and system to detect or identify one or 
more types of algorithms used in the encoding of a voice or speech waveform. The 
system and method may be used as a testing tool to identify whether a voice data stream 



was encoded using linear G.711, ^-law G.711. or A-law G.711 pulse code modulation 
(PCM) algorithms. 

[00101 In one embodiment, a method is used to identify a type of encoding used in 
generating a voice data stream comprising reading words from a voice data stream, 
generating at least one parameter using the words and determining a format in which the 
words are encoded from a plurality of possible formats. 

[00111 In one embodiment, a method of identifying a type of encoding used in generating 
a voice data stream incorporates reading words of the voice data stream, detemiining a 
first number of words of the voice data stream that corresponds to a first range of values, 
determining a second number of words of the voice data sfream that corresponds to a 
second range of values, generating ^-law linear equivalents of the one or more words of 
the voice data stream, determining a third number of words corresponding to the m-law 
linear equivalents of the one or more words that have values within a third range, 
determining a fourth number of words corresponding to the m-law linear equivalents of 
the one or more words that have values within a fourth range, generating A-law linear 
equivalents of the one or more words of the voice data stream, determining a fifth number 
of words using corresponding to the A-law linear equivalents of the one or more words 
that have values within a fifth range, and determining a sixth number of words 
corresponding to the A-law linear equivalents of the one or more words that have values 
within a sixth range. 
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[00121 In one embodiment, a system for identifying a type of encoding used in 
generating a voice data stream includes a processor, a memory, a storage device, and a set 
of computer instructions residing in the storage media. 

100131 These and other advantages, aspects, and novel features of the present invention, 
as well as details of illustrated embodiments, thereof, will be more fully understood from 
the following description and drawings. 



4 



BRIEF DESCRIPTION OF THE DRAWINGS 
[0014] Figure 1 is a block diagram of a G.711 detection system in accordance with an 

embodiment of the invention. 

[00151 Figures 2A and 2B are operational flow diagrams illustrating a sequence of steps 
used to characterize the words in a received voice data stream in accordance with an 
embodiment of the invention. 

[0016] Figures 3 A and 3B are operational flow diagrams illustrating a sequence of steps 
used to characterize the words in a received voice data stream in accordance with an 
embodiment of the invention. 

[00171 Figure 4 is an operational flow diagram illustrating a calculation of a number of 
parameters that are used in determining the type of G.711 encoding represented by the 
voice data stream file in accordance with an embodiment of the invention. 
[00181 Figure 5 is an operational flow diagram illustrating a sequence of N tests 
performed on a voice data stream file. 
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DETAILED DESCRIPTION OF THE INVENTION 
[00191 Aspects of the present invention may be found in a system and method to detect 
or identify one or more types of algorithms used in the encoding of a voice or speech 
waveform. The system and method may be used as a testing tool to identify whether a 
voice data stream is encoded using one or more pulse code modulation (PCM) 
compression algorithms defined by ITU (hitemational Telecommunications Union) 
G.71 1 recommendation specification. The system and method may be applied to a voice 
data stream comprising a number of bytes of data that has been previously stored as a 
data file. The one or more types of algorithms may comprise a 16 bit linear (in some 
instances described as uniform PCM or linear G.711), ^-law G.711, and A-law G.711 
types of pulse code modulation (PCM) algorithms. The system and method characterize 
the voice data stream in terms of one or more parameters that correlate with linear G.71 1, 
m-law G.711, or A-law G.711. Thereafter, the parameters are analyzed by way of one or 
more tests to determine which algorithm was used to encode the voice data stream. 
[0020] The system and method are appUed to a voice data stream in order to ensure that a 
codec that employs the proper decoding algorithm is used to reproduce the audio 
waveform that was transmitted. The system comprises a set of computer instructions or 
software, which resides in a computing device. The aforementioned set of computer 
instructions or software will be termed a G.71 1 detection software. The G.71 1 detection 
software may be generated using a computer language. In one embodiment, the G.711 
detection software may be generated using the C/C++ language. The G.711 detection 
software is executed by way of the computing device. The computing device will be 
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described, hereinafter, as a G.711 detection system. The G.711 detection software 
operates on a stream of data that represents an encoded speech sample. The encoded 
speech sample may comprise a stream of data bytes or words output by a transmit codec 
of a transmitter. In one embodiment, the stream of bytes may correspond to one or more 
utterances or one or more phrases spoken in one or more languages. 
[0021] Figure 1 is a block diagram of a G.71 1 detection system in accordance with an 
embodiment of the invention. TTie G.711 detection system 100 comprises a processor 
104 comiected to a memory 108, a hard drive 110. a media reader 112, a network 
interface 116, a monitor 120, a speaker 124, and a user interface 128. Also shown, as 
residing within the hard drive 1 10, is a G.71 1 detection software 132. The hard drive 1 10 
acts as an exemplary storage device. However, the invention is not so limited, and the 
G.711 detection software may reside in other storage devices, such as, for example, the 
memory 108 or in memory internal to the processor 104. The processor 104 executes the 
G.711 detection software 132 to perform detection or identification of one or more voice 
data streams. In one embodiment, the G.711 detection software 132 may be stored and 
executed at a server that is communicatively coupled to the G.711 detection system 100 
by way of its network interface 116. The server may store the G.711 detection software 
132 until the G.711 detection software 132 is required by the G.711 detection system 
100. The processor 104 may utilize its memory 108 to efficiently process and/or execute 
the G.711 detection software 132 residing in the hard drive 110. The memory 108 may 
comprise a random access memory. The voice data stream may be stored as a voice data 
file in the hard drive 1 10 or media reader 112 until it is used by the processor 104. The 



voice data stream file may comprise an exemplary <filename>.pcm type of data file. The 
<filename>.pcm file may be transmitted to the hard drive 1 10 firom the media reader 112 
or the network interface 116, as shown. The hard drive 110 may store the 
<filename>.pcm file when processing is performed by the processor 104. The media 
reader 112, may, for example, comprise a CD-ROM, fioppy disk drive, magnetic drive, 
portable USB drive, and the like. The media reader 112 is used to read one or more 
portable media inserted into the media reader 112 containing the voice data stream file. 
The network interface 116 may allow receipt of the exemplary <filename>.pcm data file 
from a computing device located in a local area network (LAN). Execution of the G.711 
detection software 132 may be accompUshed, for example, by control provided by a user 
interface 128. The user interface 128 may comprise a keyboard or mouse or other input 
device. The monitor 120 and speaker 124 are used to provide visual and audio feedback 
to a user of the G.711 detection system 100. In one embodiment, the G.711 detection 
system 100 may comprise a workstation or a server. 

[0022] Figures 2A and 2B are operational flow diagrams illustrating the sequence of 
steps used to characterize the data words in a voice data stream as if the voice was 
encoded using linear G.71 1. Figures 2A and 2B are in accordance with an embodiment 
of the invention. The received voice data stream is assumed to be a linear G.711 
(alternatively termed as a uniform PCM) data stream in reference to the sequence of steps 
presented in Figures 2A and 2B. It is contemplated that the G.711 detection system 100 
may be configured to process one or more variants of linear PCM. For example, the 



8 



variants may comprise either a Uttle-endian or a big-endian type of linear PCM voice data 
Stream. 

[0023] Referring to Figure 2A, at step 204. the G.711 detection software operates on a 
voice data stream. The voice data stream may comprise real time data comprising a 
certain number of data bytes of words. The voice data stream may comprise voice data 
encoded using linear G.711, ^-law G.711, or A-law G.711 algorithms. In one 
embodiment, the voice data stream comprises a size of 800 kilobytes, lasting 
approximately 100 seconds of audio runtime. At step 208, two counters are reset to zero. 
A first counter, termed a "linear zeros" counter, counts the number of words in the voice 
data stream file whose absolute value is below a first threshold value. The words that are 
within this first threshold value are termed "linear zeros" and correspond to words that 
are characteristic ofaUnear G.711 encoded voice data stream. A second counter, termed 
a "linear overflows" counter, counts the number of words in the voice data stream file 
whose absolute value exceeds a second threshold value. The words that exceed the 
second threshold are termed "linear overflows" and are non-characteristic of linear 
G.711. Tlie counters may be implemented by way of addressable memory registers 
within the G.711 detection system previously described in Figure 1. At step 212, a 
register in a memory of the G.711 detection system is reset to zero. This register is used 
to store a maximum value of all differences calculated between values of successive 
words of the entire voice data stream, and is alternatively termed a "linear maximum 
discontinuity jump register" (LMDJR). At step 216, a word counter that counts the 
number of words read is reset to zero. The word counter may be implemented by way of 



the addressable memory within the G.711 detection system. Next, at step 220, a word is 
read from the voice data stream or voice data stream file. At step 224, the word counter 
is incremented by one. Next, at step 226, the value of the word is determined. For 
example, the value of a binary sequence (0000000011111111) is determined by 
converting it to its decimal equivalent. In this instance, the decimal value is 255. The 
value may correspond to either a zero or an overflow value. At step 228, the first counter 
(or linear zeros counter) is incremented if the absolute value is less than or equal to the 
first threshold value. In one embodiment, the value may be a small number such as the 
exemplary decimal value 5. Next, at step 232, the second counter is incremented if the 
absolute value is greater than the second threshold value. In one embodiment, the second 
threshold value may be a large nmnber such as the exemplary decimal value 25,000. At 
step 236, the LMDJR, is updated, if necessary, by calculating the difference between the 
value of the word currently read and the value of the word previously read. If the 
calculated difference is greater than what is currently stored in the LMDJR, the difference 
replaces the current value stored in the LMDJR. Hence, after all words in a data stream 
file are evaluated by the G.711 detection system, the largest difference between 
successive word values is stored in the LMDJR. At step 240, a decision is made as to 
whether the entire voice data stream has been read. If the entire voice data stream has 
been read, the process illustrated in Figures 2A and 2B ends. Otherwise, at step 244, the 
process reverts back to step 220 where another word is read. 

[00241 Figures 3A and 3B are operational flow diagrams illustrating the sequence of 
steps used to characterize the data words in a received voice data stream as if the voice 



10 



was encoded using either jx-law G.711 or A-law G.711. Figures 3A and 3B are in 
accordance with an embodiment of the invention. The received voice data stream is 
assumed to be representations of either ji-law G.711 or A-law G.711 in reference to the 
sequence of steps represented in Figures 3A and 3B. In summary, the one or more 
methods provided by Figures 3A and 3B characterize the voice data stream in terms of 
parameters such as zeros, overflows, and maximum jump discontinuities. These 
parameters were described earlier in reference to Figures 2A and 2B, when a linear G.71 1 
characterization of a voice data stream was performed. In the embodiment of Figures 3 A 
and 3B, the values for the words or samples of the voice data stream are characterized in 
terms of overflows and zeros after the voice data stream is decoded or converted into its 
H-law or A-law linear equivalents. The voice data stream is decoded using n-law to 
linear or A-law to Hnear algorithms, in order to generate the appropriate ^-law linear 
equivalents or A-law linear equivalents. Thereafter, their respective Unear equivalent 
values are then characterized over one or more different ranges. In the embodiment of 
Figures 3A and 3B, the equivalent values are categorized as an overflow, a zero, or a 
maximum jump discontinuity. 

[0025] Referring to Figure 3A, at step 304, the G.711 detection software operates on a 
voice data stream file. The file may comprise voice data encoded in linear G.71 1, ^i-law 
G.711, or A-law G.711. The file may comprise, for example, a size of 800 kilobytes, 
lasting approximately 100 seconds of audio runtime. At step 308, all overflows and zeros 
counters are reset to zero. There are two pairs of overflows/zeros counters are used in 
associating words that correspond to "zeros" or "overflows" during a ^i-law to linear 
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conversion or an A-law to linear conversion. Next at step 312, both M-law and A-law 
maximum discontinuity jump registers are set to zero. As was described in Figures 2A 
and 2B. a maximum discontinuity jump register (MDJR) is used to determine the largest 
difference between successive linear equivalent values over the entire voice data stream 
or voice data stream file. Thereafter, at step 316, the word counter is set to zero. In this 
embodiment, each word or data sample is defined as one byte, in which one byte 
comprises eight binary digits. At step 320, a word firom the data stream is read and 
converted to its ^i-law and A-law linear equivalents. Next, at step 324, the word counter 
is incremented by one. Now referring to Figure 3B, a histogram of hexadecimal words 
may be generated based on the values read, hi this embodiment, the value of an 
exemplary 8 bit n-law or A-law hexadecimal word corresponds to one of 256 intervals 
within the histogram. The number of bits used to represent an element of the histogram 
may be proportional to the number of data words comprising the voice data stream file. 
For example, 32 bits (corresponding to a maximum count of 232) may be used to 
sufficiently represent an 800 kilobyte (or in this instance an 800 kiloword) voice data 
stream file. The 256 different hexadecimal values implement 256 x-axis intervals in an 
exemplary histogram, while the frequency of occurrence of a particular value is indicated 
on the y-axis of the histogram by way of the 32-bit counter. Hence, at step 328, the 
appropriate intervals in the histogram are updated in terms of their occurrence. At step 
332, the corresponding M-law or A-law overflows counters are incremented if the word 
values exceed their respective thresholds. Optionally, the corresponding ^-law or A-law 
zeros counters may be incremented if the linear equivalents are below their respective 
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thresholds. Alternatively, the number of words with linear equivalents corresponding to 
overflows or zeros values may be determined by summing portions of the histogram 
corresponding to their appropriate m-law or A-law linear equivalents (as will be 
described in Figure 4 with respect to the calculation of the number of zeros). Next at step 
336, the |i-law DJR, is updated, if necessary, by calculating the difference between the 
m-law linear equivalent value of the word currently read and the m-law linear equivalent 
value of the word previously read. If this difference is greater than what is currently 
stored in the ^i-law DJR. the difference is used to replace the value currently stored in the 
m-law MDJR. Hence, after all words in a voice data stream are evaluated by the G.711 
detection system, the largest difference between successive word values is stored in the 
m-law MDJR. Similarly, the A-law MDJR. is updated, if necessary, by calculating the 
difference between the A-law linear equivalent value of the word currently read and the 
A-law linear equivalent value of the word previously read. At step 340. the process ends 
if the entire voice data stream has been read. Otherwise the process advances to step 344. 
At this step, the process reverts back to step 320. allowing another word to be read from 
the voice data stream. 

[00261 Figure 4 is an operational flow diagram illustrating the calculation of a number of 
parameters which are used in determining the type of G.71 1 encoding represented by the 
voice data stream file. At step 404, m-law or A-law words whose linear equivalents 
correspond to "zeros" (termed ^l-law or A-law zeros, hereinafter) may be determined by 
identifying the corresponding intervals in the histogram. For example, the hexadecimal 
values - 0x7f, Oxff. Ox7e, and Oxfe may be identified as one or more intervals in the 
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histogram that correspond to m-law zeros. Adding the occurrences represented by these 
law zero" intervals yields the total number of m-law words in the voice data stream that 
correspond to "m-law zeros". Likewise, the hexadecimal values - 0x55, 0xd5, 0x54, and 
0xd4 may be used to identify appropriate intervals in the histogram corresponding to A- 
law zeros. Adding the occurrences represented by these "A-law zero" intervals yields the 
number of A-law words in the voice data stream that correspond to "A-law zeros". 
Although previously described and implemented in Figures 3 A and 3B using counters, it 
is contemplated that m-law or A-law words whose linear equivalents correspond to 
"overflows" (termed ^-law or A-law overflows, hereinafter) may be determined by 
identifying the appropriate intervals in the histogram and summing the occurrences. 
Next, at step 408, the corresponding percentages are calculated for linear G.711, m-law 
G.71 1 and A-law G.71 1 zeros. For example, the percentage of linear zeros is calculated 
by dividing the number of "linear zeros" by the total number of words in the data stream 
file and then multiplying by 100. Likewise, the percentage of m-law G.711 zeros is 
calculated in a similar fashion. Similarly, the percentage of A-law G.711 zeros is 
calculated. Next, at step 412, the percentages are calculated for the number of linear 
G.71 1, m-law G.71 1, and A-law G.71 1 overflows determined previously. 
[00271 Thereafter, at step 416, the normalized sum of m-law and A-law "zeros" are 
calculated using the following equation: 

zero_mag = (azero_percent + nzeroj3ercent)/100.0, wherein 
zero_mag is defmed as the normalized sum of ^-law and A-law zeros; 
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azero_percent is defined as the percentage of words at A-law zero levels (whose 
absolute value is below a threshold), and 

mzerojjercent is defined as the percentage of words at m-law zero levels (whose 

absolute value is above a threshold). 
[00281 Next, at step 420, the normalized sum of m-law and A-law "overflows" are 
calculated using the following equation: 

ovfl_mag = (aovfl_percent + movfl_percent)/100.0, wherein 

ovfl_mag is defined as the normalized sum of n-law and A-law overflows; 

aovfljjercent is defined as the percentage of words at A-law overflow levels 

(whose absolute value is above a threshold); and 

movfl_percent is defined as the percentage of words at m-law overflow levels 

(whose absolute value is below a threshold). 
[00291 Thereafter, at step 424, the normalized difference between m-law and A-law 
"zeros" are calculated, using the following exemplary equation: 

zero_difF= (abs(azero_percent - nzero_j)ercent)/(azeroj)ercent + jizero_j)ercent 

+ 0.001)), wherein 

zero_diff is defined as the normalized difference between n-law and alaw zeros; 
^zero_percent is defined as the percentage of words at ^-law zero levels (as was 
previously described); and 

azero_percent is defined as the percentage of words at A-law zero levels (as was 
previously described). 

[00301 The value 0.001 is added in the denominator as a safeguard to prevent an instance 
m which the denominator in the quotient is equal to zero. In such an event, the quotient 
is equal to infinity and the value of ovfl_diff may not be acceptable. 
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[00311 At the last step 428, of Figure 4, the normahzed sums of m-law and A-law 
"overflows" are calculated using the following equation: 

ovfl_diff = (abs(novfl_percent - aovfl_percent)/(^ovfl_percent + aovfljpercent + 
0.001)), wherein, 

ovfl_difr is defined as the normalized difference between n-law and A-law 
overflows; 

HOvfl_percent is defined as the percentage of words at ^-law overflow levels; and 
aovfl_percent is defined as the percentage of words at A-law overflow levels. 
[00321 After the parameters described in Figure 4 are calculated, a series of tests are 
successively performed during execution of the G.711 detection software to determine 
whether the voice data stream words represents a linear G.711, a ^-law G.711, or an A- 
law G.711 representation. 

[00331 Figure 5 is an operational flow diagram illustrating a sequence of N tests 
performed on a voice data stream file. The tests are applied successively in order to 
determine if the voice data stream file under test by the G.71 1 detection system is in fact, 
a representation of linear G.71 1, ^-law G.711, A-law G.711, or an unknown data stream 
based on the criterion or parameters used by the G.711 detection software within the 
G.71 1 detection system. The number of tests performed by the G.71 1 detection system 
may vary based on the characteristics of the voice data stream file. In the following 
embodiment, a maximum of ten tests may be performed in succession. The tests are 
performed successively until a test determines an outcome. If a test results in no 
outcome, the next test is performed until an outcome is generated or until the last test is 
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performed. The variables / constants used in the following exemplary tests are defined as 
follows: 

H_maxjump is defined as the maximum ^-law jump discontinuity; 
a_maxjump is defined as the maximum A-law jump discontinuity; 
l_maxjump is defined as the maximum linear jump discontinuity; 
lovfl_percent is defined as the percentage of words at linear overflow levels; 
^ovfl_percent is defined as the percentage of words at n-law overflow levels; 
aovfl_percent is defined as the percentage of words at A-law overflow levels; 
ovfl_mag is defined as the normalized sum of ^i-law and A-law overflows; 
uzero_percent is defined as the percentage of words at ulaw zero levels; 
azero_percent is defined as the percentage of words at alaw zero levels; 
lzero_percent is defined as the percentage of words at linear zero levels; 
JUMP_MAX = 40000 (Threshold for max jump for any sample to sample); 
JUMP_DIFF = 20000 (Threshold for linear/^-law/A-law max jump differences); 
THR LIN_OVFL_PERCENT = 0.01 (linear overflows below this % threshold 
are significant); 

THR_UA_OVFL_PERCENT = 0.5 (^-law/A-law overflows above this % 
threshold are significant); 

THR_LIN_ZERO_PERCENT = 50 (linear zeros above this % threshold are 
significant); 

THR OVFL DIFF = 0.25 (overflow difference threshold); 
THR OVFL_MAG = 0.02 (overflow magnitude threshold) 
THR ZERO DIFF = 0.75 ((n-law to A-law zero difference threshold) 
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THR_ZERO_MAG = 0.10 (zero magnitude threshold). 
[0034] Referring to Figure 5, at step 504, the G.711 detection software initiates the start 
of a new testing sequence by setting N=l. The variable N is an indicator of which test is 
being executed by the G.71 1 detection software. At step 508, the first test (N=l. Test #1) 
is performed. During the course of the first test, a number of decisions are made by the 
first test based on one or more parameters calculated previously. For example, at step 
512, the first test may determine whether the voice data stream file being tested 
represents linear G.711 file. If the test determmes that the voice data stream is linear 
G.71 1, it returns an appropriate message such as "Return Linear G.71 1". At step 516, the 
first test may determine whether the voice data stream file represents m-law G.711 file. 
If the test determines that the voice data stream is m-law G.71 1, it returns an appropriate 
message. At step 520, the first test may determine whether the voice data stream 
represents an A-law G.711 file. Next, at step 524, the first test may determine that the 
voice data stream is not characteristic of linear, m-law, or A-law G.711. As a 
consequence, the first test may generate an "unknown" response. Otherwise, at step 528, 
the process proceeds to the next test. At step 532, N is incremented by one, so N=2, and 
the testing process reverts to step 508 with the second test being performed. Similarly, 
the testing process continues until a decision is made by a test or until the last test is 
completed. The following ten tests may be performed sequentially to determine the type 
of G.711 represented by a voice data stream file. The embodiments provided by the 
following ten tests are exemplary, and it is contemplated that other similar tests may be 
implemented using the parameters previously determined in Figures 2 through Figure 5. 
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[0035] The first test determines if both a ^-law maximum jump discontinuity and an A- 
law maximum jump discontinuity are greater than a first threshold. In addition, the test 
determines if a difference between the n-law maximum jump discontinuity and a linear 
maximum jump discontinuity is greater than a second threshold. Furthermore, the test 
determines if a difference between an A-law maximum jump discontinuity and the linear 
maximum jump discontinuity is greater than the second threshold. Then, the first test 
verifies if a normalized sum of m-law and A-law "overflows" is above a third threshold, a 
percentage of linear overflows is less than a fourth threshold, a percentage of fi-law 
overflows is greater than a fifth threshold, and a percentage of A-law overflows is greater 
than the fifth threshold. If all these conditions are satisfied, the G.71 1 detection software 
determines that the voice data stream file is linear G.711. For example, a software 
program such as a C/C++ program may comprise the following high level language 
instructions to implement this particular test, in which exemplary threshold values for 
JUMP_MAX, JUMP_DIFF, THR_LIN_OVFL_PERCENT, THR_OVFL_MAG, 
THR_UA_OVFL_PERCENT, and THR_UA_OVFL_PERCENT were defined 
previously. 

[0036] if ((n_maxjump > JUMP_MAX) && (a_maxjump > JUMP_MAX) 
&& ((umaxjump - l_maxjump) > JUMP_DIFF) 
&& ((a_maxjump - l_maxjump) > JUMP_DIFF)) 
{ 

if ((lovfl_percent < THR_LnSf_OVFL_PERCENT) && (ovfl_mag 
>THR OVFL_MAG) 
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&& (uovfl_percent > THR_UA_OVFL_PERCENT) && 
(aovfl_percent > THR_UA_OVFL_PERCENT)) 

{ 

return RC_LINEAR; 

} 

} 

[0037] The second test determines if a percentage of linear zeros is above a particular 
threshold and if a percentage of n-law zeros and if a percentage of A-law zeros are both 
below the same threshold. If all these conditions are satisfied, the G.711 detection 
software determines that the voice data stream is linear G.71 1. For example, a software 
program such as a C/C-H+ program may comprise the following high level language 
instructions to implement this particular test: 

if ((lzero_percent > THR_LIN_ZERO_PERCENT) 

&& (azero_percent < THR_LIN_ZERO_PERCENT) 
&& (uzero_percent < THR_LIN_ZERO_PERCENT)) 

{ 

return RC_LINEAR; 

} 

[00381 The third test determines whether M-law or A-law was used to encode the voice 
data stream file. The test determines if the Maw and A-law zeros and overflows 
percentages are significantly different. For example, the G.711 detection system 
calculates whether a normalized difference between the ^-law and A-law overflows is 
greater than a normalized overflows difference threshold and a normaUzed difference 
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between the ^-law and A-law zeros is greater than a normahzed zeros difference 
threshold. If the ^-law / A-law zeros and overflows are significantly different, the G.71 1 
detection system determines if the number of ^-law overflows is greater than the number 
of A-law overflows and the A-law zero percentage is greater than the ^-law zero 
percentage. If so, then the G.711 detection system determines that the voice data stream 
file is A-law G.711. If not, the G.711 detection system determines whether the number 
of A-law overflows is greater than ^-law overflows and that percentage of ^-law zeros is 
greater than the percentage of A-law zeros. If so, then the G.711 detection system 
determines that the voice data stream file is n-law G.711. For example, a software 
program such as a C/C++ program may comprise the following high level language 
instructions to implement this particular test: 

if ((ovfl_diff > THR_OVFL_DIFF) && (zero_diff > 
THR_ZERO_DIFF)) 

{ 

if ((u_overflows > a_overflows) && (azerojsercent > 
uzero_percent)) 

{ 

return RC_ALAW; 

} 

else if ((a_overflows > u_overflows) && (uzero_percent > 
azero_percent)) 

{ 

return RC_ULAW; 
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} 

} 

[0039] The fourth test checks to see if there are no ^-law or A-law overflows before 
using M-law or A-law zeros percentages to determme an outcome. Then, the test 
determines if an A-law zeros percentage is greater than a n-law zeros percentage. If so, 
the test returns an A-law G.711 decision. If the test subsequently determines if the n-law 
zeros percentage is greater than the A-law zeros percentage, a ^i-law G.711 decision is 
returned. If either ^-law or A-law G.711 decision is not determined, the fourth test 
returns "unknown" as a decision. For example, a software program such as a C/C++ 
program may comprise the following high level language instructions to implement this 
particular test: 

if (!a_overflows && !u_overflows) /* No overflows or underflows */ 
{ 

int re = zeroCheck(azero_percent, uzero__percent); 
if (rc != RC_UNKNOWN) 
return rc; 

wherein a helper function is invoked to determine zeroCheck as shown below: 
int zeroCheck(double azero_percent, double uzero_percent) 
{ 

if (azero_percent > uzero_percent) 
{ 

return RC_ALAW; 
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} 

if (uzero_percent > azero_percent) 
{ 

return RC_ULAW; 

} 

return RC_UNKNOWN; 

} 

[0040] The fifth test determines if a normalized sum of the ^-law and A-law zeros is 
greater than a first threshold. The test subsequently determines if a normalized difference 
of the A-law and ^i-law zeros is greater than a second threshold. In addition to this 
condition, a normalized sum of the n-law and A-law overflows and a normalized 
difference between the and A-law overflows must both be less than a third 

threshold and fourth threshold, respectively. If all of the previously described conditions 
are satisfied, the G.711 detection system invokes the zeroCheck helper function 
previously described in the fourth test to determine whether ^i-law zeros percentage or A- 
law zeros percentage is greater. The fifth test returns a decision based on this helper 
fimction. For example, a software program such as a C/C++ program may comprise the 
following high level language instructions to implement this particular test: 

if ((zero_mag > THR_ZERO_MAG) && (zero_di£f > 
THR_ZERO_DIFF)) /* zeros significant */ 

{ 

if ((ovfl_mag < THR_OVFL_MAG) && (ovfl_diff < 
THR_OVFL_DIFF)) /* overflows insignificant */ 
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{ 

int rc = zeroCheck(azero_percent, uzero_percent); 
if (rc != RC_UNKNOWN) return rc; 

} 

} 

10041] The sixth test assesses whether a normalized sum of the ^-law and A-law 
overflows is greater than a first threshold and if a normalized difference of the ^-law and 
A-law overflows is greater than a second threshold. If these two conditions are satisfied, 
then an assessment is made if a normalized difference between the ^i-law and A-law 
zeros is less than a third threshold. If the third condition is satisfied, the G.711 detection 
system invokes an overflowCheck helper function to determine whether the n-law 
overflows percentage or the A-law overflows percentage is greater. The sixth test returns 
a decision based on this helper fimction. For example, a software program such as a 
C/C++ program may comprise the following high level language instructions to 
implement this particular test: 

if ((ovfl_mag > THR_OVFL_MAG) &«& (ovfl_diff > 
THR_OVFL_DIFF)) /* ovflow significant */ 

{ 

if (zero_diff < THR_ZERO_DIFF) /* zeros insignificant */ 
{ 

int rc = ovflCheck(aovfl_percent, uovfl_percent); 
if (rc != RC_UNKNOWN) return rc; 

} 
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} 

wherein a helper function is invoked to determine ovflCheck as shown below: 
{ 

if (aovfl_percent > uovfl_percent) 
{ 

return RC_ULAW; 

} 

if (uovfl_percent > aovfl _percent) 
{ 

return RC_ALAW; 

} 

return RC_UNKNOWN; 

} 

[0042] The seventh test assesses if a normalized sum of the ^i-law and A-law zeros is 
greater than a first threshold and a normalized differences of the n-law and A-law zeros is 
greater than a second threshold. If so, the G.711 detection system invokes the zeroCheck 
helper function previously described in the fourth test to determine whether (i-law zeros 
percentage or A-law zeros percentage is greater. The seventh test returns a decision 
based on this helper function. For example, a software program such as a C/C++ 
program may comprise the following high level language instructions to implement this 
particular test: 
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if ((zero_mag > THR_ZERO_MAG) && (zero_diff > 
THR_ZERO_DIFF)) 

{ 

int rc = zeroCheck(azero_percent, uzero_percent); 
if (rc != RC_UNKNOWN) return rc; 

} 

[0043] The eighth test assesses whether a normalized sum of the n-law and A-law 
overflows is greater than a first threshold and whether a normalized difference of the ^i- 
law and A-law overflows are greater than a second threshold. If both are significant, the 
G.711 detection system invokes the overflowCheck helper fimction, as was previously 
described, to determine whether the H-law overflows percentage or the A-law overflows 
percentage is greater. The eighth test returns a decision based on this helper function. 
For example, a software program such as a C/C-H- program may comprise the following 
high level language instructions to implement this particular test: 

if ((ovfl_mag > THR_OVFL_MAG) && (ovfl_diff > 
THR_OVFL_DIFF)) 

{ 

int rc = ovflCheck(aovfl_percent, uovfl_percent); 
if (rc != RC_UNKNOWN) return rc; 

} 

[0044] The ninth test assesses whether an A-law maximum discontinuity jump is greater 
than a first threshold and whether an absolute value of the difference between the A-law 
maximum discontinuity jump and a ^i-law maximum discontinuity jump is greater than a 
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second threshold. If both of these last two conditions are satisfied, then the G.711 
detection system generates a ^-law decision. Otherwise, the G.711 detection system 
assesses whether the n-law maximum discontinuity jump is greater than the first 
threshold and whether the absolute value of the difference between the A-law maximum 
discontinuity jump and the ^-law maximum discontinuity jump is greater than the second 
threshold. If both of these last two conditions are satisfied, then the G.711 detection 
system generates an A-law decision. For example, a software program such as a C/C++ 
program may comprise the following high level language instructions to implement this 
particular test: 

if ((a_maxjump > JUMP_MAX) && fabs(a_maxjump - u_maxjump) > 
JUMP_DIFF) 

{ 

return RC_ULAW; 

) 

if ((u_maxjump > JUMP.MAX) && fabs(a_maxjump - u_maxjump) > 
JUMP_DIFF) 

{ 

return RC_ALAW; 

} 

[00451 The tenth test is a combination of two subtests. The first subtest compares the 
normalized difference between ^-law and A-law overflows against two parameters. If 
the normalized difference between n-law and A-law overfiows is greater than twice the 
normalized difference between ^-law and A-law zeros while the normaUzed difference 
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between ^-law and A-law overflows is greater than a first threshold, then the G.711 
detection system invokes the ovflCheck helper function previously described to 
determine whether the n-law overflows percentage or the A-law overflows percentage is 
greater. The second subtest compares a normalized difference between n-law and A-law 
zeros versus twice a normalized difference between ^-law and A-law overflows while 
assessing the normalized difference between n-law and A-law zeros against a second 
threshold. If the normalized difference between n-law and A-law zeros is greater than 
twice the normalized difference between n-law and A-law overflows while the 
normalized difference between ^i-law and A-law zeros is greater than a second threshold, 
then the G.711 detection system invokes the zeroCheck helper function previously 
described to determine whether the ^-kw zeros percentage or the A-law zeros percentage 
is greater. 

{ 

int rc = ovflCheck(aovfl_percent, uovfl_percent); 
if (rc != RC_UNKN0W1SI) return rc; 

} 

if ((zero_diff > (2 * ovfl_diff)) && (zero_difif > THR_ZERO_DIFF)) 
{ 

int rc = zeroCheck(azero_percent, uzero_percent); 
if (rc != RC_UNKNOWN) return rc; 

} 
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[0046] The following computer output is generated by an exemplary G.711 detection 
system that executes the exemplary G.711 detection software. The G.711 detection 
software operates on an exemplary file named ingress.pcm: 

Processing iodump_raw2096_Called_bos_ingress.pcm... 

bytes=l 148400 

words=574160 

u_overflows=l 7627 

a_overflows=0 

lin_overflows=16734 

threshold = +/-25000 

alaw maxjump = 1 1776 

ulaw maxjump = 64248 

lin maxjump = 64136 

alaw zeros = 93.69% 

ulaw zeros = 0.04% 

lin zeros = 0.03% 

alaw overflows = 0.00% 

ulaw overflows = 1.53% 

lin overflows = 2.91% 

overflow magnitude (0-1) = 0.02 

zero magnitude (0-1) = 0.94 

overflow difference (0-1) = 1.00 

zero difference (0-1) = 1.00 
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ingress.pcm is ALAW 

[00471 As illustrated by the preceding output, the samples or words in the voice data 
stream file are characterized by a substantial number of A-law zeros. The values of these 
words, after converting from A-law to linear are analyzed and those words that exceed a 
particular threshold value are categorized as overflows while those that fall below a 
particular threshold are classified as zeros. In this particular data stream file, the 
percentage of A-law zeros far exceeds the percentage of ^-law zeros or linear zeros. 
Referring to the output above, the percentage of A-law zeros is 93.69% while the M-law 
and linear zeros are negligible. Another parameter of significance is the maximum 
discontinuity jump associated with values of successive words in either the linear, A-law, 
or M-law case. As illustrated in the output, the maximum discontinuity jump associated 
with the A-law case is the smallest among the three possible cases. The maximum 
discontinuity jump associated with A-law is 11,766 compared with approximately 64,000 
for the other two cases, indicating that a voice data stream decoded using A-law G.711 
results in values that are more reasonable than the same voice data stream decoded using 
either m-law G.711 or linear G.711. Hence, as illustrated by the last line of the output, 
the data stream file has been determined to be encoded using A-law (i.e., the data file is a 
representation of A-law). 

[0048] While the invention has been described with reference to certain embodiments, it 
will be understood by those skilled in the art that various changes may be made and 
equivalents may be substituted without departing from the scope of the invention. In 
addition, many modifications may be made to adapt a particular situation or material to 
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the teachings of the invention without departing from its scope. Therefore, it is intend* 
that the invention not be Hmited to the particular embodiment disclosed, but that tl 
invention will include all embodiments falling within the scope of the appended claims. 
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